From Document to Entity Retrieval: Improving Precision and Performance of Focused Text Search

نویسنده

  • Henning Rode
چکیده

PROEFSCHRIFT ter verkrijging van de graad van doctor aan de Universiteit Twente, op gezag van de rector magnificus prof. dr. to my grandfather who should have gotten a PhD long before me vi Acknowledgments Writing a thesis that sums up my scientific work of four years was a new experience for me. First of all it asked quite some patience from myself. Instead of looking forward to new scientific challenges, it forced me to re-read, rethink , and rewrite what I had done before. The confrontation with the past brought up old ideas, scientific plans, things I did as well as things I never found the time to do. And, last but not least, it made me think of all the people that accompanied me through that period and made it an exciting, enjoyable time. First, I'd like to thank my supervisor Djoerd for all his detailed reviewing work on this thesis and on my other scientific writing, which improved the presentation " by far ". But also for the nice working atmosphere we had during the whole period of my PhD, and for just being around for all kinds of questions and discussions starting on work issues but not always ending there. There have been many more people though who contributed to this research work. My promoter Peter, who always tried to keep me on track, and without him I would probably not have finished my PhD in time. who did an excellent job in reviewing my scientific work. All those people gave many fruitful input to my own work, and at the same time teached me to defend my own writing. I also want to thank the database group at the UT for the good working environment and the friendly atmosphere; our soup cooperation for providing at least the remembrance of a warm lunch. To pick out a few people: It was Maurice who had the brilliant idea to ask me whether I would like to come to the Netherlands at a time when I was not really thinking of doing a PhD. Developing our own search system PF/Tijah would not have been that vii viii successful and fun without our scientific programmer Jan, who helped me a lot with my code work when he was not climbing mountains at the remotest places of the world. Further, Sandra, Ida, and Suse could hardly have done more to support …

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Improving Precision of Keywords Extracted From Persian Text Using Word2Vec Algorithm

Keywords can present the main concepts of the text without human intervention according to the model. Keywords are important vocabulary words that describe the text and play a very important role in accurate and fast understanding of the content. The purpose of extracting keywords is to identify the subject of the text and the main content of the text in the shortest time. Keyword extraction pl...

متن کامل

IIT TREC 2006: Genomics Track

For the TREC-2006 Genomics Track, we report on the effectiveness of composite information retrieval functions based on a dimensional data model for improving document, passage, and aspect search precision of genomics literature. We designed an approach, and developed a corresponding search engine, based on a novel dimensional data model capable of document, paragraph, sentence, and passage leve...

متن کامل

The State-of-the-arts in Focused Search

The continuous influx of various text data on the Web requires search engines to improve their retrieval abilities for more specific information. The need for relevant results to a user’s topic of interest has gone beyond search for domain or type specific documents to more focused result (e.g. document fragments or answers to a query). The introduction of XML provides a format standard for dat...

متن کامل

Document Image Retrieval Based on Keyword Spotting Using Relevance Feedback

Keyword Spotting is a well-known method in document image retrieval. In this method, Search in document images is based on query word image. In this Paper, an approach for document image retrieval based on keyword spotting has been proposed. In proposed method, a framework using relevance feedback is presented. Relevance feedback, an interactive and efficient method is used in this paper to imp...

متن کامل

Experiments with Geographic Evidence Extracted from Documents

For the 2008 participation at GeoCLEF, we focused on improving the extraction of geographic signatures from documents and optimising their use for GIR. The results show that the detection of explicit geographic named entities for including their terms in a tuned weighted index field significantly improves retrieval performance when compared to classic text retrieval.

متن کامل

Multimodal Medical Image Retrieval: Improving Precision at ImageCLEF 2009

We present results from Oregon Health & Science University’s participation in the medical retrieval task of ImageCLEF 2009. This year, we focused on improving retrieval performance, especially early precision, in the task of solving medical multimodal queries. These queries contain visual data, given as a set of image-examples, and textual data, provided as a set of words belonging to three dim...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2008